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Abstract: Exploratory plotting tools have been devised aplenty in order to diagnose the goodness-of- 
fit of data sets to a hypothesized distribution. Some of them have found extensive use in diverse areas 
of finance, telecommunication, environmental science, etc. in order to detect sub-exponential or heavy- 
tailed behavior in observed data. In this paper we concentrate on two such plotting methodologies: the 
Quantilc-Quantile plots for heavy-tails and the Mean Excess plots. Under the assumption of heavy- 
tailed behavior of the underlying sample the convergence in probability of these plots to a fixed set 
in a suitable topology of closed sets of R 2 has been studied in Das and Resnick (2008) and Ghosh 
and Resnick (2010). These results give theoretical justifications for using the plots to test the null 
hypothesis that the underlying distribution is heavy-tailed by checking if the observed plot is "close" 
to the limit under the null hypothesis. In practice though one set of observations would lead to only 
one plot of each kind. This is often not good enough to verify proximity of the plot to the fixed set of 
interest under the null hypothesis of heavy-tails. An appropriate confidence bound around the plots 
covering or not covering the fixed set would stand as a better indicator for such a purpose. In this 
paper we provide weak limits for Quantile-Quantile plots for heavy-tails and Mean Excess plots when 
the underlying distribution of the sample is heavy-tailed. As an application we are able to construct 
confidence bounds around the plots which enables us to check whether the underlying distribution is 
heavy-tailed or not. 
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1. Introduction 

Statistical analysis of extremes in available data has been quite important in a variety of areas like finance 
(McNeil et al., 2005), telecommunication (Maulik et al., 2002; D'Auria and Resnick, 2006), hydrology (Katz 
et al., 2002), environmental statistics (Davison and Smith, 1990; Smith, 2003) and many more. Prior to 
analyzing features of the data using extreme value analysis it is imperative that we check whether extreme- 
value modeling is well-suited in the given context. Popular exploratory techniques in this direction have been 
the Mean Excess (ME) plots (Davison and Smith, 1990) and the Quantile-Quantile (QQ) plots which are 
specifically tuned for heavy-tailed data (Kratz and Resnick, 1996). These plots are used to detect whether or 
not the data is heavy-tailed in the sense that the tails of the underlying distributions are regularly varying 
with some tail index — 1/£ where £ > (Resnick, 2008, Chapter 1). It has been shown earlier that under 
this assumption of regular variation of tails both the QQ plot for heavy-tails and the ME plot when suitably 
normalized converge in probability to a fixed closed set, which is a straight line through the origin in most 

*The authors are thankful to Paul Embrechts (ETH Zurich), Sidney I. Resnick (Cornell University) and Gennady Samorod- 
nitsky (Cornell University) for their detailed comments on a draft of the paper which greatly helped in improving the paper. 
Souvik Ghosh was partially supported by the FRAP program at Columbia University. 
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cases (i.e., except for the ME plot for data where the mean does not exist, in which case the limit is neither 
a straight line nor a fixed set); see Das and Resnick (2008) and Ghosh and Rcsnick (2010). These results 
give proper theoretical justification for the use of the two plots mentioned to test the hypothesis that the 
data comes from a heavy-tailed distribution. But it should be noted that one data set only leads to one 
plot of each kind. Clearly one plot is not enough to detect the proximity between the plots as a closed set 
and the intended fixed set under the assumption of heavy-tails. This is clarified with examples in Section 6. 
One way to get around this will be to provide confidence bounds around these plots and see if these bounds 
contain the fixed set of interest. It is also possible to bootstrap more data sets from the one that we have and 
superimpose these plots and see if it covers the fixed set of interest. The bootstrapping method is often used 
in practice but bootstrapping in a heavy-tailed set up needs additional care while using them, for details on 
the difficulties with this see (Rcsnick, 2007, Chapter 6.4). In our approach we take the first route of finding 
confidence bounds for these plots. First we study weak limits of the plots for heavy-tailed data and then 
using these limits we describe methods of obtaining confidence bounds for both the QQ plots for heavy-tails 
and the ME plots. 

1.1. Plan for this paper 

We start by introducing the QQ plot and the ME plot in Section 1 and by explaining the problem we are 
looking at. The plots that we talk about in this paper can all be visualized as random closed sets in M 2 , hence 
notions of convergence also must be described in an appropriate sense here. In Section 2 we set up necessary 
tools to talk about convergence of random closed sets in K 2 . Also note that both the QQ plot and the ME plot 
depend on the tail empirical measure of the sample, that is, both the plots depend on a portion of the data 
that are made out of the upper order statistics of the sample. So we will provide distributional convergence 
results for the plots under the null hypothesis of the tails of the underlying distributions being regularly 
varying using convergences of appropriate functionals of the tail empirical measure. This is discussed in 
detail in Sections 3 and 4. Now, as an application to the weak limits we obtained for these plots under the 
null hypothesis, we construct confidence bounds for them in Section 5. Finally, in Section 6 we use the results 
obtained in the previous sections to construct confidence bounds on simulated and real data sets to exemplify 
how they perform in practice. We conclude in Section 7 along with a discussion on future directions. 

1.2. QQ plots for heavy-tails 

Suppose we want to test the null hypothesis that observations from a sample are i.i.d. from some known 
distribution F. One quite intuitive way to go is to make a QQ plot which is a plot of the empirical quantilcs 
from the data against the distributional quantiles of F. The QQ plot has been popular for graphically 
detecting the goodness-of-fit for a sample to the distribution F. If the true distribution of the sample is F then 
the QQ plot should converge, in an appropriate sense, to a straight line. Results involving empirical process 
and quantile process convergences are available in Shorack and Wcllncr (1986) which can be appropriately 
used to create confidence intervals for QQ plots. The QQ plot we consider is a little different and specifically 
designed to check for distributions F where 1 — F is regularly varying with some index — 1/£, £ > 0, also 
denoted 1 — F e RV_i/^ (Resnick, 2008, Chapter 1). For a sample X i} X 2 , ■ ■ ■ , X n , its decreasing order 
statistics is denoted by Xm > Xr 2 ) > • - • > -^(n) an d the QQ plot in this context is defined by 

Q n = {(-log|,log^) :l<i<fc}, k<n. 

Clearly we concentrate on the top k quantiles of the data justified by the fact that RV_i/£ only 

provides us with information about the right tail of the data. Under the null hypothesis of 1 — F e RV_i/£ 
for some £ > 0, Das and Resnick (2008) has shown convergence in probability for QQ plots in an appropriate 
topology of random closed sets when the data is assumed to be an independent and identically distributed 
(i.i.d.) sample. 



1.3. ME Plots 
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The ME function of a random variable X is denned as: 

M{u) :=E[X-u\X > u], (1.1) 

provided EX + < oo, and is also known as the mean residual life function. A natural estimate of M(u) is the 
empirical ME function M(u) defined as 

M{u) = =^ — , u > 0. (1.2) 

Z^i=l 2 [X z >u] 

The ME plot is the plot of the points {(X( k ),M(X( k ))) :l<k<n}. 

The ME plot is often used as a simple graphical test to check if data conform to a generalized Pareto 
distribution(GPD). The GPD is an important class of distributions and is fundamental for the peaks over 
threshold method. The GPD is characterized by its cumulative distribution function G^ t p : 

where f3 > 0, and x > when £ > and < x < — /3/£ if £ < 0. The parameters £ and j3 are referred to as 
the shape and scale parameters respectively. For a Pareto distribution, the tail index a is just the reciprocal 
of £ when £ > 0. 

For a random variable X ~ G$ t p , we have E(X) < 00 iff £ < 1 and in this case, the ME function of X is 
linear in u: 

M(u) = + T^ u ' (1-4) 

where 0<u<ooif0<£<l and < it < — /3/£ if £ < 0. In fact, the linearity of the ME function 
characterizes the GPD class, cf. McNeil et al. (2005); Embrechts et a!. (1997). Davison and Smith (1990) 
used this property and suggested that if the ME plot is close to linear for high values of the threshold then 
there is no evidence against the use of a GPD model. See also Embrechts et al. (1997) and Hogg and Klugman 
(1984) for the implementation of this plot in practice. Ghosh and Resnick (2010) discusses convergence in 
probability for the high thresholds of suitably normalized ME plots in an appropriate topology of random 
closed sets when the data is an i.i.d. sample. 

The advantage of the ME plot over the QQ plot is that it works when — 00 < £ < 1, i.e., when the sample 
is in the maximal domain of attraction of the Weibull or the Gumbel. In this paper though we restrict to 
the case when £ > which is the case of maximal domain of attraction of the Frechet distribution. The 
disadvantage of the ME plot is that it requires £ < 1 to make proper sense of the result, in other words, for 
the ME plot to make sense the underlying distribution should have a finite mean, although limits can and 
has been obtained for the ME plots even when the distributional mean is not finite. 



2. Preliminaries 



2.1. Topology on closed sets of W 2 

Since we are dealing with plots which are closed sets in K 2 it is imperative to understand the topology on 
closed sets. We denote the collection of all closed (compact) sets in K 2 by T (JC respectively). We consider 
a hit and miss topology on T called the Fell topology. The Fell topology is generated by the families 
\T K , K compact} and {J^g, G open} where for any set B 

T B = {F e T : FOB = 0} and F B = {F e F : F n B ^ 0} 

Hence F B and Tb are collections of closed sets which miss and hit the set B, respectively. This is the reason 
for which such topologies are called hit and miss topologies. In the Fell topology a sequence of closed sets 
{F n } converges to F e F if and only if the following two conditions hold: 
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• F hits an open set G implies there exists N > 1 such that for all n > N, F n hits G. 

• F misses a compact set K implies there exists N > 1 such that for all n > N, F n misses K. 

The Fell topology on the closed sets of M 2 is metrizable and we indicate convergence in this topology of a 
sequence (F n ) of closed sets to a limit closed set F by F n —> F. Often though, it is easier to deal with the 
following characterization of convergence. 

Lemma 2.1. A sequence F n £ J- converges to F € T in the Fell topology if and only if the following two 
conditions hold: 

1. For any t £ F there exists t n £ F n such that t n — > t. 

2. If for some subsequence (m n ), t mn € F rrln converges, then lim t mn £ F. 

See Theorem 1-2-2 in (Matheron, 1975, p. 6) for a proof of this Lemma. 

Let ajr denote the Borel er-algebra generated by the Fell topology of open sets (not to be confused with 
open sets in Mr). A random closed set X : >— > T is a measurable mapping from (CI, A, P ) to (F,ajr). 
Denote by P the induced probability on ajr, that is, P = P o X^ 1 . 

Since the Fell topology is metrizable, the definition of convergence in probability is obvious. The following 
result is a well-known and helpful characterization for convergence in probability of random variables and it 
holds for random sets as well; see Theorem 6.21 in (Molchanov, 2005, p. 92). 

Lemma 2.2. A sequence of random sets (F n ) in J- converges in probability to a random set F if and only 
if for every subsequence (n 1 ) ofL + there exists a further subsequence (n") of (n') such that F n » — > F-a.s. 

A sequence of random closed sets (X n ) n >\ weakly converges to a random closed set X with distribution 
P) if the corresponding induced probability measures (P n ) n >i converge weakly to P, i.e., 

P n {B) = P n o X-^B) -> P(B) =P'o X-\B), as n -> oo, 

for each B £ a? such that P(dB) = 0. This is not always straightforward to verify from the definition. The 
following characterization of weak convergence in terms of sup-measures is very useful, cf. (Vervaat, 1997). 
Suppose h : R d -> M + = [0, oo). For X C R d , define h(X) = {h(x) : x e X} and h v is the sup-measure 
generated by h defined by 

h v (X) = sup{h{x) :xeX} 

(Molchanov, 2005; Vervaat, 1997). These definitions permit the following characterization (Molchanov, 2005, 
page 87). 

Lemma 2.3. A sequence (X n ) n >i of random closed sets converges weakly to a random closed set X if and 
only ifEh v (X n ) converges to E/i v (X) for every non-negative continuous function h : R d — > R with a bounded 
support. 

We use the following notation: For a real number x and a set A C K™, xA — {xy : y d A} and 
x + A = {x + y : y G A}. 

Matheron (1975) and Molchanov (2005) are good references for the theory of random sets. 

2.2. Miscellany 

Throughout this paper we will take k :— k n to he a sequence increasing to infinity such that k n /n — > 0. For 
a distribution function F(x) we write F(x) := 1 — F (x) for the tail and the quantile function is 

b(u) := F«"(l - \) = inf | S : F(s) > 1 - ^} = (^J-^) *"(„). 

A function U : (0, oo) — > M. + is regularly varying with index peR, written U £ RV p , if 

U{tx) 
£5Zo U(t) 
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It is discussed in several books such as Resnick (2007, 2008); Seneta (1976); Geluk and de Haan (1987); 
de Haan (1970); dc Haan and Fcrrcira (2006); Bingham et al. (1987). 

We use M_|_(0, oo] to denote the space of nonnegative Radon measures fj, on (0, oo] metrized by the vague 
metric. Point measures are written as a function of their points {xi,i = 1, ...,n} by Y^—x S Xj . See, for 
example, (Resnick, 2008, Chapter 3). 

We will use the following notations to denote different classes of functions: For < a < b < oo, 

1. C[a,b): Continuous functions on [a,b). 

2. D[o, b): Right-continuous functions with hnitc left limits defined on [a, 6). 

3. Dj[a, b): Left-continuous functions with finite right limits defined on [a, b). 

D[0, 1] is complete and separable under a metric do(-) which is equivalent to the Skorohod metric ds{-) 
(Billingsley, 1968, page 128), but not under the uniform metric || • ||. As we will see later, the limit processes 
that appear in our analysis below are always continuous. We can check that if x is continuous (in fact 
uniformly continuous) in [0,1], for x n € D[0, 1], \\x n — x\\ — > is equivalent to ds(x n ,x) — > and hence 
equivalent to do(x n , x) — ¥ as n — > oo (Billingsley, 1968, page 124). So we use convergence in uniform metric, 
for our convenience henceforth. For spaces of the form D[a,b) or D/[a, b) we will consider the topology of 
locally uniform convergence. In some cases we will also consider product spaces of functions and then the 
topology will be the product topology. For example, Df[l, oo) will denote the class of 2-dimensional functions 
on [1, oo). The classes of functions defined on the sets [a, b) or (a, b] will have the obvious notation. 

3. Limit results for the QQ plots 

Convergence of empirical processes and quantile processes to functionals of Gaussian processes, usually 
Brownian motion and Brownian bridges are quite well-known, cf. Shorack and Wellner (1986). We prove 
similar results for extreme order statistics. We use the weak limit of tail empirical measure and deduce weak 
convergence of the logarithmic version of the QQ plot of the extreme order statistics as a random set. 

First we discuss some limit results in the context of QQ plots that are already known. The following was 
proved is Das and Resnick (2008): 

Proposition 3.1. Suppose X\, . . . ,X n are i.i.d. with common distribution F and -XVi) > -^(2) > ■ ■ • > ^(«) 
are the order statistics from this sample. If F is strictly increasing and continuous on its support, then 

T n := I (f^ (^i) ;!<*<n}4T := |(a:,x) : x £ support(F)} 

in T . 

The proposition was first proved for the case where F is the uniform distribution on [0, 1] and then it was 
extended to general distributions on R. This proposition though is not enough if one is interested in creating 
confidence bounds from the data. For that purpose one would need weak convergence results. These results 
are widely known when the distribution F is known. We briefly state the results below but would refer to 
(Shorack and Wellner, 1986, Chapter 3) for further details. 

3.1. QQ plots for a sample with known target distribution F 

We start with the simplest case. Suppose Ux, U2, ■ ■ ■ , U n are i.i.d. U(0,1). Denote the order statistics of this 
sample by Ur X ) > U(2) > • • > U^ n y then 

7; = {(^-j-,L/ (n _ i+1) ), l<z<n} and T = {(x, x) : < x < l}. (3.1) 
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Then Proposition 3.1 yields T n -r T in K,. Now consider the empirical process 



1 " 

Fn(t) = -22 I lUi<t] for 0<t<l. 



n 

i=l 



and the quantile process defined as 



G n {t) : = F^(t) = inf{z : F n (x) > t} 

% — 1 % 

— Ur n — for < t < — and 1 < i < n 

n n 

and G„(0) = 0. Let T* = {(*, G n {t)) : < t < 1}. Clearly T„ C T„* in K 2 . We can also check that T n a -4' T 
in J- is equivalent to 7^* ^4 T* in J 7 . 
Now keeping weak limit in mind, define 

B n (t) = Vn(G n {t)-t). 

Let B denote a Brownian Bridge on [0, 1], i.e., a Gaussian process on C[0, 1] with 

E(B(t))=0 and cov(B(s), B{t)) = s At — st V0<s,t<l. 

It is a simple extension of the CLT to check that B n B in finite dimensional distributional sense. In fact, 
we can define the uniform random variables Ui and B on the same probability space (f2,^4, P) such that 
\\B n — B\\ — > almost surely (Shorack and Wellner, 1986, Chapter 3). 

This result for the uniform case can be easily extended to the case where X\ 1 X2, ■ ■ ■ ,X n are i.i.d. from 
some distribution F which has density / = F' and also g — (F^ 1 )' . As in the uniform case we define: 

G n (t) := F^(t) = mf{x : F n (x) > t} 

i — \ i 

= Xr n _ i+1 \ for < t < — and 1 < i < n. 

y ' n n 

and G ra (0) = 0. Now define 

B n (t) = Vng(t)(G n (t) - G(t)). 

where G(t) := F^(t), < t < 1. Here again we know that B n — > B in fidi's and an almost sure convergence 
is possible under a special construction (Shorack and Wellner, 1986). 

3.2. QQ plots in the regularly varying case 

Now assume that X\,Xi, . . . , X n are i.i.d. from a distribution F. Suppose we want to check from the sample 
whether F is heavy-tailed or not, more specifically, we want to check whether 1 — F is regularly varying 
with some index — i.e., F € EV_-^u, with £ > 0. Hence in the sense of testing a hypothesis, our null 
hypothesis is that F £ RV^i/^ for some £ > 0, but we really do not have any specific form for F. We define 
the following sets: 



Q„ = {(-log|,log-^) :l<j<k}, k<n, (3.2) 

Q = ((x,&y.x>o\. (3.3) 

The set Q n is the logarithmic version of the QQ plot for the first k order statistics from the sample X\ , . . . , X n . 

We know that under the null hypothesis, Q n — > Q as k,n — > oo with k/n — > from Das and Rcsnick 
(2008). We show henceforth that a distributional convergence can also be obtained in this case. 
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Assumption 3.2. We say that F satisfies condition R if 

lim Vk(^F(b(n/k)y-t) - y) = (3.4) 
locally uniformly on (0, oo] as k,n,n/k — > oo. 

Theorem 3.3. Suppose Xi,X%, ■ ■ ■ ,X n are i.i.d. F, where F € RV^i/^, with £ > and satisfies condition 
R. Then 

ON n := j^-log|,-£log|+Vfc(\og X(3) 

=> ON := {( -log*,-£logt + ^r 1 B{t)^ : < t < l| in 
where B(t) is a Brownian Bridge on [0, 1] restricted to (0, 1]. 

Remark 3.4. The set QM n is a suitably normalized version of the QQ plot which allows us to obtain a 
weak limit. It is important to observe that the format in which we have expressed the result is not standard 
in the literature as far as weak limits of random variables or functions are concerned. Usual weak limit results 
will only consider the normalized difference of the random variable from its mean or its limit in probability. 
In our setting it is imperative to state the result in the form which we have used. We look at the plot as the 
probability limit perturbed by the normalized deviation around it; i.e., we shift the normalized differences 
so that we can obtain the distribution of the deviation of the observed points of the QQ plot from its mean 
position. If we do not make this shift, the weak limit will always hover around the y-axis and will not give 
the deviation from the actual point in the plot. 

Remark 3.5. We have assumed Condition R in order to prove the a weak limit for the QQ plots. Without this 
assumption we can show the convergence of tail empirical measure with unknown centering ^F{b(n/k)y~^) 
as in (3.7) but we wish the centering to be y here. To achieve this 



lim 

n— f oo 



Vk (^F(b(n/k)y~t) - y 



should exist and have a finite limit which we assume to be without loss of any generality. The same theorem 
can be proved by replacing Condition R with the stronger condition of second order regular variation de Haan 
and Ferreira (2006); de Haan and Stadtmueller (1996); de Haan and Peng (1998). Neither condition R nor 
the second order RV condition are easy to check in practice, albeit we resort to assuming them in order to 
obtain distributional limits. 

Proof. We will derive this result from the weak convergence of the tail empirical measure. The tail empirical 
measure is defined as 

1 " 

:= h ^2 e x t /b( n /k){-) (3.5) 



=1 



as a random element of M + (0, oo] where e x (-) puts unit mass at x. By Theorem 4.1 (Rcsnick, 2007, p. 79) 
we get that 

v n v in M + (0, oo] (3-6) 
where v{y, oo] = y^ x ^,y > 0. Furthermore, Theorem 9.1 in (Resnick, 2007, p. 292) gives us 

Vk (v n (y-Z,oc] - ^F(b(n/k)y-^ => W(y) in D[0, oo). (3.7) 

where W is a standard Brownian motion on [0, oo). Since F satisfies condition R we obtain 

Vk(v n (y-t, oo] - yj W{y) in D{0, oo). (3.8) 
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We will use this to find the limiting distribution of 



Vkl log 



(fc) 



ClogtJ =Vfcio g 0jWj, 0<t<l. 



By an abuse of notation let v n {y) '■= v n{y j oo]. Then for < t < 1, 



'(*) := in£{y : v n (y «, oo] > t} = inf I y : ^ e x . /b{n/k) (y c ,oo] > kt \ = [ j^^ 



Xr 



Note that we can apply Vervaat's lemma (Proposition 3.3 in (Resnick, 2007, p. 59)) to (3.8) to 

-t)^W(t) inA(0,l]. 



-VS 



get 



M n / k ). 

Therefore, using the continuous map / : £>/(0, 1] — > A(0, 1] with f(x)(t) = x(t)/t, we have 



-lU? inA(0,l] 



(3.9) 



(3.10) 



Also observe that 



Vfc log 



6(n/jfc) 



^ ] = -\/fc£log 



1- 1- 



X, 



6(n/fc) 

- 1 ) +o P ( Vk£ 



Y \ -US. 

%W1), x 



6(n/fe) 



.6(n/fc) 

So from (3.10) and (3.11) it follows that 

We again use the continuous mapping theorem with / : A(0, 1] — > Di(0, 1] defined as f(x)(t) 
to get the following: 



(3.11) 



(3.12) 

x(t) - x(l) 



Vfclog ( ) = ^log - Vklo g 

\ X {k) J X {k) b{n/k) 



X, 



Xth\ 



Wl)-^ in A (0,1]. 



(3.13) 



We know that tW(l) - W(t) = B(t) on Di[0, 1]. Therefore it is true on a restriction and hence 

•£logt) =K* -1 #(t) in A (0,1] 



>/jfc( log* (rW1) 



(fc) 

where _B is a Brownian Bridge on [0, 1] restricted to (0, 1]. Furthermore, we also get 



- log ^'-^ log ^ 



X 



M log*M+£l og M ] U f-logt.-eiogt+^l in A 2 (0,1] (3.14) 
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using the converging together lemma (Proposition 3.1 in (Rcsnick, 2007, p. 57)) and the fact that 

v/fc^logM-logt^ 

locally uniformly on (0,1]. The weak convergence of the set QAf n follows easily from (3.14). We give the 
argument in the proof of Theorem 4.3 and do not give it here to avoid repetition. □ 



4. Limit results for the ME Plots 

4.1. Empirical ME function for known distribution F 

Suppose Xi, . . . , X n is an i.i.d. sample from distribution F. Yang (1978) studied the properties the empirical 
ME function M(u) in (1.2) as an estimator of M(u). She showed that M(u) is uniformly strongly consistent 
for M(u): for any < b < oo 



P 



lim sup 

n-+oo 0<u<b 



M(u) - M(u) 







Yang (1978) also proved a weak limit for M(u): for any < b < 1 



V^[M(F^(t))-M(F^(t)) 

where U(t) is a Gaussian process on [0, b] with covariance function 

(1 -t)a 2 (t) ~t9 2 {t) 



= 1. 



U(t) 



T(s,t) 



(l-*)(l-i) 2 



for all < s < t < b 



ith 



a 2 (t) = var (XI [t<F ( X )<i]) and 0(f) = E (Y/ [t<F(x) < 1] ) 



Although these properties are stated for the empirical ME function it can be shown using the arguments at 
the end of the proof of Theorem 4.3 that the ME plots also exhibit the same features when the distribution 
F is known. 



4-2. ME plot in the regularly varying case 

Yang (1978) does not explain the behavior of M(u) near the right end point of F. Here we study the 
asymptotic properties of the ME plot when the explicit form of the distribution F is not known. As we had 
discussed for the QQ plots case, suppose we want to test for the null hypothesis that F £ for some 

£ > 0. Ghosh and Resnick (2010) proved the limit in probability of a suitably scaled version of the ME plot 
under this null hypothesis: 

Theorem 4.1. If Xi, . . . ,X n are i.i.d. observations with distribution F satisfying F £ RV—i/^ with < £ < 
1, then in J- 

S n ~^-{(X {i) ,M(X (i) )):i = 2,...,k}^S:= : 1 ^ l }~ ( 4>1 ) 

In this paper we obtain the weak limit of the ME plot when the null hypothesis that F 6 f° r 
some £ > holds . The weak limit depends on the value of £. We get different limits depending on whether 
£ < 1/2, 1/2 < £ < 1 or £ > 1. 
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4.2.1. Case I: < £ < 1/2 

In this case var(Xi) exists and we obtain a Gaussian limit for the suitably normalized ME plots. The 
following assumption is essential to prove the limit. This is comparable to the Assumption 3.2 that was 
required to obtain the weak limit of the QQ plot. As we discussed in Remark 3.5 it is very difficult to check 
this condition in practice. 

Assumption 4.2. We say that F satisfies condition R' if 

Km Vk(yF(b(n/k)y-t) - y) = 

locally uniformly on (0, 00] and 

-F(b{n/k)y) - y~ 1/6 dy -> 

as k, n, n/k — > 00. 

Theorem 4.3. Suppose X\, . . . ,X n are i.i.d. observations from distribution F satisfying F G RV-i/£ with 
< £ < 1/2 and condition R' . Then 



i = 2,...,kj 

MM := I (V« + &-( 1+ ®B(t), Y^t^ + J q y~ {1+C ] B{y)dy^j , < t < 1 J in F, 
where B(t) is the standard Brownian bridge on [0, 1] restricted to (0, 1]. 

Remark 4.4. Similar to Theorem 3.3 we look at the ME plot as the probability limit perturbed by the 
normalized deviation around it and obtain a weak limit in Theorem 4.3. The assumption that £ < 1/2 is 
essential. Note that 

t />oo />oo /"S/ - 1 ft 

y- (1 +^W(y)dy = / W{u~ 1/ t)du= / / dW(s)dy = / s^dW(s). 
Jt-z Jt-"' Jo Jo 

and it well known that the last integral exists if and only if J Q s~ 2 ^ds < 00 for which it is necessary and 
sufficient to have £ < 1/2, cf. (0kscndal, 2003, Lemma 3.1.5, p. 26). This means 

y- (1+ ^B(y)dy £ f y-^W(y)dy - W(l) [ y^dy 




exists if and only if £ < 1/2 and the same is true for the limit M.M . 
Proof. Consider a functional form of the ME plot: 



n=te^):=f^,^M))y te(0)1] (4 . 2) 



as random elements in Df(0, 1]. Following the proof of Theorem 3.2 in Ghosh and Resnick (2010) we know 
that £„(•) -A S(-) in Df(0, 1], where 



s(t) = (t-z iY ^-t-z) 



for all < t < 1. 
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Applying Vervaat's lemma (Rcsnick, 2007, Proposition 3.3, p. 59) to (3.8) we get 

(^k ( ^ - *J - V^KGT*, oo] (-W(t),W(t)) in Df(0, 1]. 

By an application of the Delta-method to (4.3) we obtain a weak limit of the first component 
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£t- (1+ ^W(t) in A (0,1]. 



This means that 

v/fc(s«(i)-SW(*) 



Vk 



X (k) 



X [k) Vk {b(n/k) J X {k) V * 



X, 



( fe ) 



b{n/k) 



i r 



Next we consider the second component of S n and observe that 

M(X (m) ) k 



(4.3) 



(4.4) 



(4.5) 



(4.6) 



X, 



(fe) 



r**l - 1 ./*< 



(r**i)/ b («/ fc ) 



P n (x, oo]dx, 



where 



(4.7) 



1 

i=l 

Using (3.8) and the converging together lemma (Rcsnick, 2007, Proposition 3.1, p. 57) we know that 

(^(^ 1/? ),1) in£>(0,oo] x R. 

Using the fact that the map T : D(0,oo] x R — > D(0, oo] defined by T{g,x)(t) — g(tx) is continuous at 
g € C(0, oo] we get 

(z>„(y,oo] - (y 



b(n/k) 



b(n/k) 



X, 



-i/e 



b{n/k). 



in D(0,oo] 



Furthermore, (3.8) implies 

H n (y) := Vk(p n (y, oo] - y~ 1 /^ W(y-W) - y~^W{l) L B(y-^) =: (4.8) 
Define the maps T and Tk from from _D[0, oo) to D[l, oo) by 



r(/)(t) 



f(x)dx and T«-(/)(t) 



f(x)dx. 



(4.9) 



We understand T(f)(t) = oo if / is not integrable. By (4.8) and the continuity of the map Tk we get that 
Tk{Hu) => T K (H). We also claim that for any e > 



lim lim sup P 



\T K (H n )-T(H n )\\ >e 



0. 



(4.10) 



Note that for any e > 



lim lim sup P 

A"->oo n _>oo 



< lim lim sup P 



< lim lim sup P 
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\\T K (H n )-T(H n )\\ >e 

(if n (y t oo] ~ y~ l/i )dy 



Vk 
Vk 



K 



K 



> € 



V n y 



x, 



b{n/k) 



, oo 



lim lim sup P 



,-vs 



K 



Y \ -V4 



b(n/k) 



b(n/k) 
l] 



> e/2 



> e/2 



Using (3.9) and the assumption that £ < 1/2 we get 



lim lim sup P 

K— >00 „_i.oo 



A" 



(*■) 



6(n/fc) 



-i/S 



> e/2 



Using a change of variable we obtain 



lim lim sup P 

K—tOO ji — yr^Q 



Vk 



(&) 



K 



< lim lim sup P 

K— >oo „j m 



KX {k) /b(n/k) 



b{n/k) 



y \ -1/0 



6(n/jfc) 



6(n/fc) 



Now fix any r\ > and note that 

lim P 

Therefore, 



(fe) 



b(n/k) 



> r\ or 



&(ra/fc) 



X, 



(AO 



> »7 



(fc) 



= 0. 



> e/2 
> e/2 



lim lim sup P 



< lim lim sup P 



KX (k) /b(n/k) 

(1 + r/)Vfc 



1-77) V 



M, Ool — It 1 ' ^ 



U, 00 — li 



6(n/fc) 
X (fe) 



> e/2 
> e/2 



o(l) 



Now since F satisfies condition R' it suffices to show that 



lim lim sup P 

K— S-oo n _»~, 



(z/„(y,oo] - -F(b(n/k)y))dy 



> 



2(1-7?) 



= 



12 



(4.H) 



This can be easily proved using the arguments in the proof of Proposition 9.1 in (Resnick, 2007, p. 296). 
Observe that 



K(l-v) 



(u n (y, oo] - jF{b(n/k)y))dy 



> 



2(l-r?) 



< P 



1 f°° n - e 

Tk j K(1 _ v) E l^/6(n/*,(y, oo] - F(b(n/k)y)\dy > 2^ 
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( e \ 

< 



e \ / n 



, . var[e Xi/b{n/k) (y,oo}]dy 

2(1 -V)J JK(l-n) k 



OO 

n 



The last limit follows from Karamata's Theorem, c.f. (Resnick, 2007, p. 25). Since £ < 1, the integral in the 
last expression is finite and therefore (4.11), and hence (4.10) holds . From Theorem 3.5 in (Resnick, 2007, 
p.56) we get T{H n ) => T{H) = B(y-Vf )dy in C(0, oo]. 

Now consider the random element Y n in the space Dj(0, 1] x C(0, oo], 



r„:=(^,T(tf„)Y 

V -A-(fe) / 

From what we have obtained so far it is easy to check that Y n Y, where 

Y(t,u) = (t-t, f°° B{y-W)d y ). 

J U 

The map f : D t (0, 1] x C(0, oo] -> D/(0, 1] defined by 

f(f,g)(t)=g(f(t)) for allO <t<l 
is continuous at (/, g) € C(0, 1] x C(0,oo] and therefore 

f(Y n )(t) = Vk (i) n (y,^}-y- 1 ^)dy^ B(y- l ^)dy in A (0,1]. 

Jx (M) /x (i) v 7 Jt-« 

This gives us the weak limit for the second component of S n 

Vk(SW(t) - gW(t)) = Vfc ( M( ^ 1)} - j4g^) ^\ j™B{y-Vt)dy inA(0,l]. (4.12) 

We proved the weak convergence of the two components of S n separately. Both these limits were obtained 
using transformations on v n and the weak limit of v n . Hence we get the weak convergence of S n 

Vk{S n (t) - S(t)) U~^B(t), - jH B(y~ 1 ^)dy^j in Df(0, 1]. 

It is then easy to check that 

^- M B(t), \ J^B{y-^)dy\ ± Ur^B{t),\ j\-^B(y)dy^ in Df (0,1]. 



Also observe that 
S n (t) 



^(M) _ fMV^ ^(*(r*«p) g /"My 



x (fc) y k J ' x {k) l - £ y fc 

S(t) := (V«, T^^ ? ) + K*- (1+e) S(t), I jT* 2/-( 1+ «)B(y)^ in A 2 (0, 1], 
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since 



\kt\ 



locally uniformly on (0, 1] 

Now we adapt this result to convergence of the ME plot and prove the statement of the theorem. By 
Lemma 2.3 it suffices to show that 



lim E 

n— >oo 



h y (MN n ) = E h y (MAf) 



for any continuous function h : M 2 H> R + with a compact support. So take any such function h. By the 
Skorokhod representation theorem there exists a probability space (Q,,Q,P) and random elements 5* and 
S* in D, 2 (0, 1] such that 



in the sense of fidi and 



Now observe that 



S* = S n and S* = S 



S* n ^S* in D? (0,1] P-a.s. 



h v {MAf)= sup h(x)= sup h(S*(t)) and h v {MAf n ) = sup h(x) = sup h(S* n {t)). 

x£MAf 0<t<l xeMM n l/fe<t<l 

Since S*(t) is continuous, k — > oo and h is continuous with a compact support, we get 

sup h(S*(t))^ sup h(S*(t)) P-a.s. 

l/fc<t<l 0<t<l 

and since h is bounded we apply the dominated convergence theorem to get 



E h V {MN n ) 

Hence the proof is complete. 



E 



sup h(S* n (t)) 
i/fc<t<i 



E 



sup h(S*(t)) = E h v (MAf) 
o<t<i - 1 L 



□ 



4.3. Case II: 1/2 < £ < 1 

When 1/2 < £ < 1, the distribution F admits a finite mean but not a finite variance. The ME function 
though exists and we know the limit in probability of the scaled ME plot from Theorem 4.1. 

Assumption 4.5. We say that F satisfies condition R" if 

lim Vk(yF(b(n/k)y-t) - y) = 

locally uniformly on (0, oo] and 



1 fkb{n/k) x . 



6(n) V 1 - ? 
/or every < m < 1 where for any I < n 

cl/n 



< - r, i 



C/,n '■= n I F^(l — u)du. 
Jo 



(4.13) 



B. Das and S. Ghosh/Exploratory plots in extremes 



15 



Theorem 4.6. Suppose X\, . . . ,X n are i.i.d. observations from distribution F satisfying F € RV^i/^ with 
1/2 < £ < 1 and condition R" . Then 



MM n := 




Vk 



T fx {l) /*\-c\ kb{n/k) f M(x {l) ) £ 



(fe) 



fc^n/fcj i^Afflj L_(iV 

U/ /' 6(n) 1 X (fe) 1-fU/ 



i = 2,...,k 



^MN := | + + *~ ^Vf J ,0 < * < l| in 

where B(t) is the standard Brownian bridge on [0, 1] restricted to (0, 1] and is a stable random variable 
independent of B(t) with characteristic function 



E[e^] = exp | - I ^r(2 - i) cos ^|t|V« 



1 - i sgn(t) tan — 



(4.14) 



Remark 4.7. A very interesting point to note here is that the two coordinates of the weak limit M.M are 
independent. The empirical ME function depends on the sum of the order statistics -XVi), • • ■ ,X^ k y When 
1/2 < £ < 1 this sum is dominated by a very few high order statistics and it turns out that the contribution 
of Xqa to the suitably normalized M(X^)) vanishes in the limit. The proof below formalizes this idea. 

This feature is in stark contrast to what happens in the case < £; < 1/2. In that case all the top k 
order statistics have some contribution to M[Xa.\) in the limit. Hence the two coordinates in the limit are 
obtained from the same Gaussian process and are definitely not independent. 

Remark 4.8. Unfortunately, we are unable to obtain a proper weak limit of the ME plot in the case when 
£ = 1/2. In this case it is known that the weak limit of the suitably normalized sum of the first k order 
statistics is Gaussian, cf. Csorgo et al. (1991). So this would be similar to what happens when < £ < 1/2 
but the problem is that the integral j y~ 2 dB(y) does not exist. It is possible to redefine the ME plot in a 
different way by leaving out a few of the top order statistics and obtain a limit in that case but we did not 
pursue that direction. 

Proof. From Theorem 3 in Csorgo et al. (1986) we know that if I = l n — > oo with l n /n — > then 



(4.15) 



Observe that by Karamata's theorem (Rcsnick, 2007, Theorem 2.1, p. 25) 

(n/l)b(n/l) lb{n/l) 



C, 



1 V (s)/s2 *~ n wo 2 (i-o- i-* 



Choose I — l n such that l/k as n~ >oo. Fix any < u < 1. Then 



Vk 



X 



(\kt\) 



X, 



(fc) 



1 

' Hn) 



(£t-^B(t),S m ) 



i=l 

in Di[u, 1] x 




(4.16) 



where B(t) and S±/^ are as described in the statement of the theorem. The convergence of the coordinates 
Vn(t) and Vn^ of V n (t) follows from (4.5) and (4.15). The asymptotic independence of Vn^ft) and Vn is 
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a consequence of Theorem D in Csorgo and Mason (1985) or Satz 4 in Rossberg (1967). Using (3.8) we get 
that 

rwi 1 



kb{n/k) 



/ ^(y)dy in AM] 



i=[feu]+l 

and since £ > 1/2, kb(n/k)/(b(n)Vk) —> which implies 

e(t):= ^J_i_^ Jf<i) __l_ (t , 

where e A [it, 1] denotes the identically zero function. Furthermore, using Theorem 2 in Csorgo et al. 
(1986) we get 

' [feu] 



') ->0 in AK 1] 



(4.17) 



Vkb(n/k) 



X (i) - (C fc „,„ - <7,,„) => N(0, 1) 



and hence 



[fcu] 



(4.18) 



Combining (4.16), (4.17) and (4.18) and the converging together lemma (Resnick, 2007, Proposition 3.1, 
p. 57), we get an important building block of this proof 



U n {t) ■= (vM(t),U^t),UW,VW) => (tt-^W(t) : 0,0,S m ) mDf[u,l] x 
Next we consider 



(4.19) 



Z, 



kb{n/k) ( M(X {[kt]) ) £ 
b(n) { 



X 



(k) 



GDf[u,l] 



(2) 

and focus on the second coordinate Z\ (t) 



z n 2) (t) 



kb(n/k) 
b{n) 

kb(n/k) 

kb(n/k) 
bin) 

kb(n/k) 
bin) 

kb(n/k) 
tb(n) 



X {k) 



l-£ 



M(x ([kt]) ) _ e 



6(n/fc) 



[fei]-l 



fl-fctl-lWn/AO ^ 6(n/fc) l-£ 



o P (l) 



[fei]-l 



1 y 

(\kt]-l)b(n/k) 
[fct]-l 



(i) 



^-rt _ +0 P (1) 



1 1 



■op(l) 



ic/( 2 )(t) + i[/( 3 ) + ^i 2 )+op(l) 
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where the last equality holds because of condition R" . Therefore, we get 

Z n (t) ^r^B(t),t- 1 S 1/$ ) in Df[u,l}. 

Since the above limit holds for every < u < 1 it holds in £>z(0, 1] as well. The rest of the proof is completed 
following the last part of the proof of Theorem 4.3. □ 



44. Case III: £ > 1 

In this case the distribution F need not have a finite mean and the ME function may not be defined. It 
definitely does not exists if £ > 1. Still the empirical ME plot can have a limit. 

Theorem 4.9. Assume X\. . . . , X n are i.i.d. observations with distribution F such that F € RV-i/£ and 
condition R' holds: 

1. If£> 1, then 



MN n := 



:i = 2,...,k 



) Uw )' b(n)/k 

MM := { (s,t-( 1+ ^B{t), tS lf ^) : t > l} 

in J- , where S\/£ is the positive stable random variable with index l/£ which satisfies for t G 



£[e lt5 v 5 ] =expi -r(l--JcoB^|t| 1 / e [l-isgn(t)tan^ 



and B(t) is a Brownian bridge independent of Si/f. 
2. If ^ = 1 one? fe satisfies k — k(n) 00, fc/n — > 0, and 



then 



MN n ■ = 



kb(n/k)/b(n) — > 1 as n — > 00, 



j%) 

^X (fc) U/ y ' &(n/fc) ib{n) 



:i = 2,...,k 



MM := {t^-B^Si - 1 - log*) : t > l} 



m J 7 , where 



r-kjn 
Jl/n 



Si is a positively skewed stable random variable satisfying 



E[e ltSl ] = expiii f ( 



' sin x 1 
x 2 x(l + x) 



dx - \t\ 



isgn(t) log |*| 



and -B(i) is a Brownian bridge independent of S\ 



B. Das and S. Ghosh/Exploratory plots in extremes 



18 



Proof. The theorem is proved in the same fashion as the previous ones. First we prove the weak limit in the 
functional form of the ME plot and then we infer the the weak limit of the plot as a random set. Define 

S n (t) = < for all < t < 1. 

I V { x (k) 1 ) > b(n/k) \kt\b(n)) m P ar MuJ 

We have already proved the weak limit of Sn (t) and the weak limit of S n 2 \t) is proved in Theorem 3.4 in 
Ghosh and Resnick (2010). The rest of the proof is completed following the lines of Theorem 4.6. □ 



5. Confidence Bounds for the Plots 
5.1. QQ plots 

The limit distribution we obtained in Theorem 3.3 is a linear transformation of {B(t)/t : < t < 1} where 
B(-) is a Brownian Bridge on [0, 1]. Clearly B(t)/t blows up near t = 0. Thus for proper inference purposes in 
terms of constructing confidence bounds we need to truncate away from zero. Hence we will fix < e < 1 but 
very close to zero. Clearly again "Jk(G n (t) — —G(£j\ converges weakly to B(t) jt in Di [e, 1], following notations 

in Section 3. So in the limit we want to find M > such that the probability of P ( max ' B ^' > M ] is 



very small. 

Note that it is easy to check that W(t) := (1 + t)B f^pjj , < t < oo is a Brownian motion on [0, oo) 
when B(t) is a Brownian Bridge on [0, 1]. Thus 

\B(t)\ jgCgk)! \w{t)\ 

max = max t 1 - — = max 

\W{t)\ \W{t)\ 
— max = max , 

I f 7 <t<oo t t>8 t 

where 8 = 

1 — e 

Now we compute the above probability which helps us to calculate the probability that a straight line 
through the origin (which is the intended limit of the QQ plot) actually lies within the confidence bounds 
that we can build with the distributional result we obtained in Theorem 3.3. 

Proposition 5.1. Suppose W(t) is a Standard Brownian Motion on [0, oo). Then for all 5 > and M > 0, 
the following holds: 

\W(t)\ 

t>8 



P f sup J— p 1 > Mj =4^2 [$((4fc + 1)MVS) - $((4fc - 1)MVS) 



fe=i 



Proof. The proof is an application of a result in Doob (1949), see (Shorack and Wellncr, 1986, page 38). It 
says that if W(-) is a simple Brownian motion on [0, oo), then for a, c > and a, j3 > 0, 

oo 

P {-(at + f3)< W(t) <at + b,\ft>0) = l-J2 [e~ 2Ak + e~ 2Bk - e~ 2Ck - e~ 2Dk ] (5.1) 

k=l 

where 

A k = k 2 ab + (k - l) 2 a/3 + k(k - l)(a/3 + ba), B k = (k - l) 2 ab + k 2 af3 + k{k - l)(a/3 + ba), 
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C k = k 2 (ab + a/3) + k(k - l)a/3 + k(k + l)ba, D k = k 2 (ab + a/3) + k(k + l)a/3 + k(k - l)ba. 



Now 



p(sup™ >M 

\t>6 t 

= 1 - P(-Mt < W(t) < M(t), Mt > 5) 

= 1 - P(-M(t -5)- MS - W(5) < W(t) - W(8) < M(t - S) + MS - W(S), Vt > S) 

MS 



(5.2) 



= 1 - 



P(-Mt - (MS + s) < W(t) - W(S) < Mt + {MS - s) yt > 0\W(S) = s)f z (s)ds 



s=-MS 

(conditioning on W(S) and denoting the p.d.f. of Z := W(S) ~ N(0, 5) by f z ) 

MS 



= 1 - 



P(-Mt - (MS + s) < W(t) < Mt + (MS - s) Vt > 0)f z (s)ds. 



(5.3) 



s=-MS 



The last equality results from the stationarity and independent increment property of B.M., {W^t) — W(S) : 

t > S} _L W(8) and {W(t) - W(8) : t > 8} = {W(t) : t > 0}. Now we can use (5.1) on the integrand of (5.3) 
and calculate 



p ( su P ra > M]=i- 

>t>6 t 



MS 



s=-MS 



i - J2 (z~ 2Ak + 



k=l 



where 



A k = ElS - E k s, 



e -2B k _ e -2C k _ e -2D k) 



B k = ElS + E k s, 



fz(s)ds. 



C k = F 2 S - F k s, D k = F 2 + F k s, 

with E k = (2k - 1)M, F k = 2fcM, Vfc > 1. Also note that for any o,kI, and Z ~ N(0, S), 

a 

—a — bS s 



e bs f z (s)ds = e b ' 5 ' 2 



Vs 



- $ 



Vs 



Hence we can compute 

MS 

^ e -2A k , . -21!,. , 

-MS 



(e- 2A "+e- 2B ")f z (s)ds 



MS 



MS 



-2E 2 k S+2E k s 



f z ( S )ds+ I e- 2E i s - 2E ^f z (s)ds 



-MS 

e -2E 2 k S e (2E k ) 2 S/2 



-MS 



e -2ElS e (-2E k yS/2 



MS-2E k 5\ 

Ts J 

MS + 2E k S 

Ts 



4> 



-MS - 2E k S 



Vs 

-MS + 2E k 5 

71 



= [$(-(4fc - 3)MVS) - $(-(4fc - 1)MVS) + $((4fc - 1)MVS) - <S>((4,k - 3)mVS)} 
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Fig 1 . Twice the shaded region gives the required probability. The scale is in terms of M\/6. 



2[$((4fc - l)AlVS) - $((4fc - 3)AlVS)] 



Similarly, 



MS 



( e - 2Ck + e- 2Dk ) f z (s)ds = 2[$((4fc + l)My/5) - $((4fc - l)MVS)} 
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Remark 5.2. Figure 1 depicts the kind of integral we are computing in order to calculate the required 
probability. It is clear the number is very close to 1 if MyS is small. 

We can approximate the above infinite sum by a finite sum which can be optimized depending on our 
choice of M and S. If we fix a small S, i.e, e now we can numerically approximate the quantiles of the 
distribution of sup t>(5 \ w ™ using Proposition 5.1. This is what we use for creating confidence bands for the 
QQ plots in the examples in Section 6. Simulation suggests that considering the first 15 terms of the infinite 
sum is enough to give us approximations correct up to the first 6 decimal places. 



5.1.1. Confidence band for QQ plots 



We will provide a confidence band for a truncated version of the QQ plot defined by Q n in (3.2) where we 
truncate near zero for reasons explained earlier. For some truncation level < e < 1, ideally close to zero, 
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let 

Q^:={(-log|,log-^) :l<i<fcaad|>e}, k < n, (5.4) 

(fe) 

Clearly under this truncation, Theorem 3.3 would imply that QAf £ n =$> QAf £ , where QN e n and QJ\f e are the 
truncations of QN n and QAT respectively. Now for < a < 1, the (1 — a)100% confidence band for the limit 
QQ plot Q £ (Q in (3.3) restricted to t € (e, 1)) is: 

K + ho,v):ve(-^,*¥)\, (5-5) 



where c a: s is the (1 — a)-th quantile of sup — ^ with S = One can calculate c a ^s using Proposition 5.1 

t>8 e 

now. 

An important point to note here is that we are suggesting use the weak limit of the QQ plot to obtain 
the confidence band. In practice even if we have a large data set it will always be finite. A natural question 
that arises here is what is the rate of convergence in these cases. We do not have the answer at the moment 
but all the simulation study that we have done strongly suggest that this method works well. 



5.2. ME plots 

Under the assumption that the distributional tail F € RV-i/£ with £ > we have observed in Section 4 that 
the weak limits we obtain for ME plots are different for the three different cases we considered. 

In the case < £ < 1/2 (limit given in Theorem 4.3) where F has a second moment we obtain a Gaussian 
limit, in terms of Brownian Bridges and finite integrals of Brownian Bridges. In order to convert this result to 
obtain confidence bounds we need to compute boundary crossing probabilities for these Gaussian processes. 
Analytical solution for these probabilities are available for linear boundaries (Doob (1949)), piecewise linear 
boundaries (Potzclbcrgcr et al., 2001) in case of Brownian motion on [0,oo). Non-linear boundaries are 
usually approximated using piecewise linear boundaries. For applying Theorem 4.3 we require such results 
for non-linear boundaries on functionals of Brownian Bridges which is not readily available. Hence we resort 
to Monte Carlo simulation which is described in detail in Section 6. 

For 1/2 < £ < 1 we know that the second moment of F does not exist but the mean exists. Theorem 
4.6 provides the limit here where we have a functional of Brownian Bridge on the first component and a 
Stable distribution on the second component. One feature here is that the normalization required to get 
the limit depends on b(n) and b(n/k), which in turn depends on the distribution function F and is hence 
unknown. These can be estimated respectively with and -XVfc) in practice. Although to replace them 
theoretically we would need to know the joint behavior of sum of top k order statistics with the maxima 
and k — th maxima of the distribution. Results akin to (Resnick, 1986, Section 4) can come handy here. See 
Darling (1952), Chow and Teugels (1978). Using Theorem 5.3 in Darling (1952) we can show that under the 
assumptions of Theorem 4.6 

^n:4((^Y,A-J^y) (5-6) 



i = 2,. 



+ (y/k( x ® - dY*) kX{k) M{X(i)) e f*r f 

^ \X (k) \k) J ' X W \ X (k) l-tVie) , 

where Si/j is independent of B(t) and its characteristic function is of the form 

E[e iX§ i/s] = e lX (l + -jj (e ltx - 1 - itX^-^dt) . (5.7) 
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We can again use simulation to obtain the quantiles of this distribution. 

For £ > 1, F need not have a finite mean and the ME plot does not have any nontrivial, nonrandom limit. 
We have shown weak limits in Theorem 4.9. Therefore, it is not sensible to obtain confidence bands in this 
case and hence is not pursued. 



5.2.1. Confidence band for ME plots 

We need to truncate the ME plot near infinity in this case since the weak limits we obtain (Theorems 4.3 
and 4.6) blow up there (relates to t near in the limit A4Af n ). If S n , as defined in (4.1) denotes the ME 
plot for a sample of size n (with k < n top order statistics under consideration), then for < e < 1 small, 
let denote S n restricted to 1 < j < k with j/k > e. If < £ < 1/2 then using Theorem 4.3 we can give 
the (1 — a)100% confidence band for S e (S in (4.1) restricted to t € (e, 1)) as 

5 - + H : xe -k ) ' y e {-ir> ir) } < (5 - 8) 

where 

c Qi£ = (1 - a)-th quantile of sup £t~ (1+5) .B(i), (5.9) 

e<t<l 

t 

da.e — (1 — a)-th quantile of sup £i _1 / y~( 1+ ^ B(y)dy. 

e<t<l J 


If 1/2 < £ < 1 then we use Theorem 4.6 and its modified form in (5.6) to give the (1 — a) 100% confidence 
band for S e as 

Sa/2,6 ° a /2,e\ ^ /X (1) tf (1 _ a)/2 X (1) d a/2 \ 1 




Vk ' Vk ) V 3 x (k) ' jX( k ) 



where 



d a = (1 — a)-th quantile of S±/£ defined in (5.7). 

The above quantiles are calculated by Monte Carlo for the simulation we report in Section 6.1.2. 

Remark 5.3. Throughout the literature of extreme value theory people always consider the top k order 
statistics where k = k n — > oo and k/n — > 0. The idea is that as the size of data increases we concentrate 
more on the right end point of the underlying distribution. In practice though when you have a data with a 
fixed size, albeit large, it is diffucult to decide on which value of k to choose. The popular solution is to try 
out different values of k which have been exemplified in the Hill plot and the Pickands plot. 

In order to obtain confidence band for the QQ plot and the ME plot along with the problem of choosing 
k we also have to choose e. The choice of e should be such that, for the purpose of drawing any inference, 
we leave out the region where data is sparse. In practice we have to try out different values of e depending 
on the size of the data and the choice of k. 



6. QQ Plot and ME Plot in Practice 
6. 1 . Simulation 

We do a simulation study using the software R to check how well this method of obtaining confidence bounds 
for the QQ plot and the ME plot work. 
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Fig 2. QQ Plot for 50000 i.i.d. Pareto random variables with £ = 0.25. 



6.1.1. QQ plots 

We begin with a simple exercise for Pareto distribution with £ = 0.25 (F(x) = x~ 4 ,x > 1). We simulate a 
sample of size n — 50000 from this distribution and look at the QQ plot for extremes as defined in (3.2), see 
Figure 2. The black line denotes the plot Q n and the brown dotted line denotes the true line Q. We know 
that Q n converges to Q and as we see in the plot the two lines are close expect for the top right corner of 
the plots which correspond to the very large order statistics. We choose 3 different values for k: 2000, 1500 
and 1000 which are large in absolute terms but small compared to the sample size n. 

Following the discussion in Section 5 we know that the variance of the limiting distribution blows up as 
we move towards the extreme order statistics (towards the top-right corner) in the plot. So while obtaining 
a confidence bound we truncate at [e£;J-th order statistic for e = 0.05 and 0.01. The confidence bounds are 
obtained for the 6 cases. The three shades of the colored band signifies the 99%, 95% and the 90% confidence 
band for the plot. As is evident in Figure 2 the true line lies within the bound in all the cases. It is also 
notable that the width of the confidence band increases as k and e decreases. 

We use the result in 5.1 to compute the width of confidence band. Also note that the limit depends on the 
value of £ and while obtaining the width of the band we replace £ by its Hill estimate (Rcsnick, 2007, p. 74). 
We could use any consistent estimator of £ and the choice of the estimator does not seem to be important 
as far as the simulation study is concerned. It is well known that estimating the parameter £ can often be 
extremely tricky, see "Hill-Horror plots" in (Rcsnick, 2007, p. 87). But as far as obtaining confidence bounds 
is concerned we can get past that by using a conservative estimate of £, i.e. a value which we strongly believe 
is not less than the true value of £. 

Next we do a similar study for a right-skewed stable distribution with £ = 2/3(a = 1.5) and mean 0. We 
use the same values of n, k and e. The result is given in Figure 3. Here also we see that the method works 
well and the confidence band contains the true line in all the 6 cases. 

We also try a non-standard distribution for which F^ 1 (x) ~ a; _1 / 5 (l — 10 _1 In a;), < x < 1. This means 



B. Das and S. Ghosh/Exploratory plots in extremes 



k=2000, £=0.05 k=1500, e=0.05 k=1000, e=0.05 




~i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 

2 4 6 2 4 S 01234567 
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Fig 3. QQ Plot for 50000 i.i.d. right-skewed stable random variables with £ = 2/3. 





Fig 5. ME Plot for 50000 i.i.d. Pareto random variables with § = 0.25. 



that F £ RV-4 and therefore £ = 0.5. The exact form of F is given by 

F(x) = ^W{2xe 2 fx~ 5 for all x > 1, (6.1) 

where W is the Lambert W function satisfying W(x)e w ^ — x for all x > 0. Observe that W(x) — > oo as 
ic — > oo and W(x) < log(a;) for x > 1. Furthermore, 



log(a;) logW(a;) 



oo, 



and hence W(x) is a slowly varying function. This is therefore an example where the slowly varying term 
contributes significantly to F. That was not the case in the Pareto or the stable examples. The result of the 
simulation is shown in Figure 4. As expected, the choice of k plays an important role in this case and we see 
that the confidence band contains the true line when we choose k = 1000 and e = 0.01. Although not shown 
in Figure 4 the confidence bands perform better for smaller values of k. 



6.1.2. ME plots 

Figure 5 shows the ME plot obtained from a data simulated from the Pareto distribution with £ = 0.25. The 
6 plots correspond to different values of k (3000, 2500 and 2000) and e (0.1 and 0.075). The black line is the 
observed ME plot and the brown dotted line denotes the limit in probability. Again, the three shades of the 
colored band signifies the 99%, 95% and the 90% confidence band for the plot. Note that the weak limit is a 
functional of the Brownian bridge and depends on £. We estimate £ using the Hill estimator and obtain the 
bounds by simulating 10000 paths from the weak limit. 

A striking feature in all these plots is that they are close to being linear near the bottom left corner 
and becomes quite erratic near top right corner. The reason behind this phenomenon is that the empirical 
ME function for high thresholds is the average of the excesses of a small number of upper order statistics. 
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Fig 6. ME Plot for 50000 i.i.d. right-skewed stable random variables with £ = 2/3. 
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k=3000, £=0.075 k=2500, £=0.075 k=2000, £=0.075 




1 2 3 4 5 1 2 3 4 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 



Fig 7. ME Plot for 50000 i.i.d. random variables with the distribution described in (6.1) f£ = 0.2). 
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When averaging over few numbers, there is high variability and therefore, this part of the plot appears very 
non-linear and is uninformative. Therefore, while obtaining confidence bands it is essential to leave out some 
of the extreme order statistics. We would also like to point out that without the confidence bands it would 
have been difficult to believe that these plots were obtained from a a distribution with tail index 0.25. 

A simulation of ME plot for the right skewed stable distribution with £ = 2/3 is shown in Figure 6. We 
use the band described in (5.10) and estimate the quantiles using simulation. In this case we only provide 
the 95% and the 90% confidence band. The 99% confidence band for the stable is very large and using that 
is not much helpful. 

The next simulation is the ME plot for a sample from the distribution function described in (6.1) and 
the result is given in Figure 7. We use the same values for n, k and e. We see that this method of getting 
confidence bands works well in these cases. 



6.2. An example with a real data 

We study a data set which contains Internet response sizes corresponding to user requests. The sizes are 
thresholded to be at least 100KB. The data set consists of 67287 observations and is part of a bigger set 
collected in April 2000 at the University of North Carolina at Chapel Hill. 




It is often stated that file size data typically exhibits heavy-tails and we observe that is indeed the case 
here. Figure 8 shows various plots from this data set. The sample variance is of the order of 10 13 which 
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suggests that the variance is possibly infinite for the underlying distribution (denote by F). This would 
imply that if F is regularly varying for some £ then we must have £ > 1/2. This is suggested by both the 
Pickands plot and the Hill plot (Figure 8(b) and (c) respectively). The Hill plot is always above 1/2 and the 
Pickands is above 1/2 for most of the range. But it is difficult to get an estimate of £ using these two tools 
since both the plots are highly fluctuating and hence inconclusive. We fit a GPD model with the top 2000 
order statistics using the command 'fit. GPD' in the library 'QRMlib'. It gives an estimate 0.6218 of £ and 
Figure 8(d) plots the estimated F in the log- log scale along with the fitted line. 

We try the QQ plot with data set for k = 2000 and 1300 (top 3% and 2% order statistics approximately) 
and with e — 0.05 and 0.02. The plots give an estimate of around 0.62-0.63 of £. The plots are shown in 
Figure 8(e)-(h). The ME plots for k = 2000, 1300 and e = 0.1, 0.07 are shown in Figure 8(i)-(l) and they also 
suggest a similar range for £. 

We observe that in this example the different methods of understanding the tail behavior of a data work 
very well and all of them are in agreement about the value of £. This is not true in many situations and 
then it is hard to judge which method one should trust. In those cases it is important to have some more 
knowledge about the system from which the data was collected and often that helps in the understanding 
of the data. 

7. Conclusion 

Plotting techniques have always been popular as diagnostic tools for goodness-of-fit of observed data and we 
believe they will remain so because of their visual and intuitive appeal. In this paper we have concentrated 
on two such tools used extensively in the extreme-value literature. A weak law of large numbers have been 
shown previously for both the QQ plots (Das and Rcsnick, 2008) and ME plots (Ghosh and Resnick, 2010) 
considering them as random elements in an appropriate topology. Our contribution in this paper has been to 
provide distributional limits for them. In the case of QQ plots we have also provided an explicit expression 
for confidence bounds (with a truncation to avoid the confidence bounds from blowing up) by using these 
distributional results. In the case of ME plots we have obtained distributional limits in the cases < £ < 1/2, 
1/2 < £ < 1 and £ > 1 separately where the underlying distribution F is assumed to be regularly varying 
with index — 1/£. The case £ = 1/2 is still open. We have produced confidence bounds for the ME plots in 
these cases by Monte Carlo simulation, as explicit expressions for these quantities are not easy to calculate. 
The explicit expressions would involve boundary crossing probabilities for a Brownian Bridge with non-linear 
boundaries. Boundary crossing probabilities for Brownian motion can be approximated using piecewise linear 
boundaries (Potzelbcrgcr ct al., 2001) but we do not know of a nice approximation for the Brownian Bridge 
case, hence we resort to simulation. We have illustrated the confidence bounds in both the cases of QQ plots 
and ME plots with simulated and real data examples in Section 6. The importance of the confidence bounds 
can be understood very clearly from Figure 5. Here we have a simulated data set of 50000 points from a 
Pareto distribution with parameter £ = 0.25. Just looking at ME plot it is not at all obvious that this is 
a heavy-tailed data whereas when the confidence bounds with the e-truncation are drawn, the straight line 
with slope £ = 0.25 remains inside the bounds indicating the true nature of the data. 

Since we are using the limiting distribution to obtain the confidence bounds it is natural to ask what the 
rate of convergence is. We have observed that this method works well in the simulation studies that we have 
done but we have not answered this theoretically. This is currently a work in progress. 

A standing assumption in the results we proved in this paper is that the random variables X n are iid. 
We believe that it is possible to obtain similar results under a more general assumption of stationarity and 
mixing, cf. Rootzen (2009). We intend to look into this further. 

We should also note here that often practitioners use the median-excess plot with the implied meaning 
when £ > 1, i.e., the mean for the distribution does not exist (Embrechts et al., 1997), but we have not 
ventured into this kind of plotting tool. We have also not looked into other kinds of plots used in extremes like 
the Starica plot (Starica, 1999) to determine the right k number of upper order statistics, or the Gertensgarbe 
and Werner plot (Gertensgarbe and Werner, 1989) for determining thresholds over which a data may be 
assumed to be extreme- valued or the more popular Hill plot, Pickands plot (Resnick, 2007) to detect the 
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right value of the extreme- value parameter. Obtaining results in the same spirit as this paper for these other 
varieties of plots are a part of intended future research. 
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